Bluestein's FFT for arbitrary N on the hypercube

نویسندگان

  • Paul N. Swarztrauber
  • Roland A. Sweet
  • William L. Briggs
  • Henson Van Emden
  • James S. Otto
چکیده

The original Cooley-Tukey FFT was published in 1965 and presented for sequences with length N equal to a power of two. However, in the same paper they noted that their algorithm could be generalized to composite N in which the length of the sequence was a product of small primes. In 1967, Bergland presented an algorithm for composite N and variants of his mixed radix FFT are currently in wide use. In 1968, Bluestein presented an FFT for arbitrary N including large primes. However, for composite N , Bluestein’s FFT was not competitive with Bergland’s FFT. Since it is usually possible to select a composite N , Bluestein’s FFT did not receive much attention. Nevertheless because of its minimal communication requirements, the Bluestein FFT may be the algorithm of choice on multiprocessors, particularly those with the hypercube architecture. In contrast to the mixed radix FFT, the communication pattern of the Bluestein FFT maps quite well onto the hypercube. With P = 2d processors, an ordered Bluestein FFT requires 2d communication cycles with packet length N ⁄2P which is comparable to the requirements of a power of two FFT. For fine-grain computations, the Bluestein FFT requires 20log2N computational cycles. Although this is double that required for a mixed radix FFT, the Bluestein FFT may nevertheless be preferred because of its lower communication costs. For most values of N it is also shown to be superior to another alternative, namely parallel matrix multiplication. * Computational Mathematics Group, Department of Mathematics, University of Colorado at Denver, Denver, CO 80204. † This work was completed while this author was visiting the University of Colorado at Denver. ‡ National Center for Atmospheric Research, Boulder, Colorado 80307, which is sponsored by the National Science Foundation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fusion of Panchromatic and Multispectral Images Using Non Subsampled Contourlet Transform and FFT Based Spectral Histogram (RESEARCH NOTE)

Image fusion is a method for obtaining a highly informative image by merging the relative information of an object obtained from two or more image sources of the same scene. The satellite cameras give a single band panchromatic (PAN) image with high spatial information and multispectral (MS) image with more spectral information. The problem exists today is either PAN or MS image is available fr...

متن کامل

Multiprocessor FFTs

Several multiprocessor FFTs are developed in this paper for both vector multiprocessors with shared memory and the hypercube. Two FFTs for vector multiprocessors are given that compute an ordered transform and have a stride of one except for a single "link" step. Since multiple FFTs provide additional options for both vectorization and distribution we show that a single FFT can be performed in ...

متن کامل

The Complexity of FFT and Related Butterfly Algorithms on Meshes and Hypermeshes

Parallel FFT data-flow graphs based on a Butterfly graph followed by a bit-reversal permutation are known, as are optimal-order embeddings of these flow-graphs onto meshes and hypercubes. Embeddings onto a 2D mesh require O(sqrtN) data transfer steps and O(logN) computation steps. Embeddings onto a hypercube require O(logN) data transfer steps and O(logN) computation steps. A similar FFT algori...

متن کامل

Efficient FFT on Torus Multicomputers: A Performance Study

In this paper, the problem of computing a one-dimensional FFT on a c-dimensional torus multicomputer is focused. Different approaches are proposed which differ in the way they use the interconnection network of the torus. One of the approaches is based on the multidimensional index mapping technique for FFT computation. A second approach is based on embedding on the torus a hypercube algorithm ...

متن کامل

Matchings Extend to Perfect Matchings on Hypercube Networks

In this work, we investigate in the problem of perfect matchings with prescribed matchings in the n-dimensional hypercube network Qn. We obtain the following contributions: For any arbitrary matching with at most n − 1 edges, it can be extended to a perfect matching of Qn for n ≥ 1. Furthermore, for any arbitrary non-forbidden matching with n edges, it also can be extended to a perfect matching...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Parallel Computing

دوره 17  شماره 

صفحات  -

تاریخ انتشار 1991